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I. INTRODUCTION 

The microcanonical and canonical ensembles are the 
two main probability distributions with respect to which 
the equilibrium properties of statistical mechanical mod- 
els are calculated. As is well known, the microcanoni- 
cal ensemble is a statistical mechanical expression of the 
conservation of the energy for a closed or isolated sys- 
tem whereas the canonical ensemble models a system in 
thermal interaction with a heat reservoir having a con- 
stant temperature P, Q- Although the two ensembles 
model two different physical situations, it is widely as- 
sumed that the ensembles give equivalent results in the 
thermodynamic limit; i.e., in the limit in which the vol- 
ume of the system tends to infinity. The typical argument 
used to motivate this equivalence is given in the classic 
text of Landau and Lifshitz (3. Ahhough "the canonical 
distribution is 'spread' over a certain range of energies," 
"the width of this range ... is negligible for a macroscopic 
body." The conclusion is that, in the thermodynamic 
limit, the canonical ensemble can be considered to be an 
ensemble of fixed mean energy or, in other words, a mi- 
crocanonical ensemble. The texts P, 0, IE IE contain 
similar arguments. 

In order to substantiate the argument of Landau and 
Lifshitz, it must be proved that the energy per particle 
converges to a constant in the infinite- volume limit of the 
canonical ensemble. This convergence can be proved to 
hold for noninteracting systems such as the perfect gas 
and for a variety of weakly interacting systems. For gen- 
eral systems, however, neither is this convergence valid 
nor is the conclusion true concerning ensemble equiva- 
lence which this convergence is intended to motivate. In 
fact, in the past three and a half decades, numerous sta- 



tistical models have been discovered having microcanoni- 
cal equilibrium properties that cannot be accounted for 
within the framework of the canonical ensemble. This 
lack of correspondence between the microcanonical and 
canonical ensembles has profound consequences for it im- 
plies that one is forbidden to substitute the mean-energy 
variable for the temperature variable, and vice versa, 
when parameterizing the equilibrium properties of sys- 
tems. In such cases of nonequivalence, the questions of 
determining which of the two ensembles is the more fun- 
damental and which ensemble is the one realized in the 
laboratory are of fundamental interest. 

Until now, the phenomenon of nonequivalent ensem- 
bles has been identified and analyzed almost exclusively 
by determining regions of the mean energy where the 
microcanonical entropy function is anomalously noncon- 
cave or by determining regions of the mean energy where 
the heat capacity, calculated microcanonically, is nega- 
tive. This thermodynamic approach to the problem of 
nonequivalent ensembles has been propounded by a num- 
ber of people, including Lynden-Bell and Wood @ , who 
in the 1960's were among the first to observe negative 
heat capacities in certa in grav itational many-body sys- 
tems (see also |llTollTllll1lllTi|l More recently, non- 
concave anomalies in the microcanonical entropy as well 
as negative heat capacities have been observed in models 
of fluid turbulence 

EE EE III m and models of plas- 
mas 19J, in addition to long-range and mean- field spin 
models, including the mean-field XY model and the 
mean-field Blume-Emery-Griffiths (BEG) model 

The existence of such nonconcave anomalies invalidates 
yet another tacit principle of statistical mechanics which 
states that the one should always be able to express the 
microcanonical entropy, the basic thermodynamic func- 
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tion for the microcanonical ensemble, as the Legendre- 
Fenchel transform of the free energy, the basic thermo- 
dynamic function for the canonical ensemble. Indeed, 
if the microcanonical entropy is to be expressed as the 
Legendre-Fenchel transform of the canonical free energy, 
then the former function must necessarily be concave on 
its domain of definition. Hence, if the microcanonical 
entropy has nonconcave regions, then expressing it as a 
Legendre-Fenchel transform is impossible. When this oc- 
curs, we say that there is thermodynamic nonequivalence 
of ensembles. 

For the BEG model, the existence of nonconcave 
anomalies in the microcanonical entropy was shown by 
Barre, Mukamel and Ruffo via a Landau expansion of 
this quantity |2lj |. The aim of the present paper is to 
extend their study of nonequivalent ensembles by show- 
ing how the thermodynamic nonequivalence of the mi- 
crocanonical and canonical ensembles for this model re- 
flects a deeper level of nonequivalence that takes place 
at the statistical level of its equilibrium distribution of 
states. We carry this out via numerical calculations 
both at the thermodynamic level and at the statistical 
level, illustrating, in particular, a striking statistical con- 
sequence of the nonconcavity of the microcanonical en- 
tropy. Namely, we demonstrate that if the microcanoni- 
cal entropy is nonconcave at a mean-energy value u — i.e., 
if thermodynamic nonequivalence of ensembles holds — 
then nonequivalence of ensembles holds at the statistical 
level in the following sense: none of the equilibrium sta- 
tistical distributions of states calculated in the microca- 
nonical ensemble at that value of u can be realized in the 
canonical ensemble. Furthermore, we demonstrate that if 
the microcanonical entropy is strictly concave at u — i.e., 
if thermodynamic equivalence of ensembles holds — then 
all the statistical distributions of states found in the mi- 
crocanonical ensemble at that value of u can be put in 
one-to-one correspondence with equivalent distributions 
of the canonical ensemble. 

Our numerical flndings illustrate clearly and directly 
a number of results on equivalence and nonequivalence 
of ensembles that are valid for a wide class of statisti- 
cal mechanical systems. These results, derived recently 
by Ellis, Haven and Turkington, establish different lev- 
els of correspondence between microcanonical equilib- 
rium macrostates and canonical equilibrium macrostates 
in terms of concavity properties of the microcanonical 
entropy. These results, as well as an overview of applica- 
tions to turbulence, can be found in their comprehensive 
paper . A speciflc application to a model of geophys- 
ical fluid turbulence is treated in their paper 18] . 

The organization of the present paper is as follows. In 
Section ^ we introduce the basic thermodynamic func- 
tions in the two ensembles, the microcanonical entropy 
and the canonical free energy. We also motivate the defi- 
nitions of the sets of equilibrium statistical distributions 
of states for the two ensembles in the thermodynamic 
limit. This motivation is based in part on the theory of 
large deviations, a branch of probability that studies the 



exponential decay of probabilities and is perfectly suited 
for analyzing asymptotic properties of the two ensembles. 
In Section UTTI we discuss the equivalence and nonequiva- 
lence of ensembles first at the thermodynamic level and 
then at the statistical level. In Section IIVI we finally 
present our numerical calculations that illustrate ther- 
modynamic and statistical equivalence and nonequiva- 
lence of ensembles for the BEG model. Because our goal 
is to emphasize the physical ideas, we have omitted al- 
most all mathematical details and have occasionally com- 
promised perfect mathematical accuracy when it benefits 
the exposition. For complete mathematical details con- 
cerning equivalence and nonequivalence of ensembles and 
the theory of large deviations, the reader is referred to 



II. THERMODYNAMIC ENSEMBLES AND 
LARGE DEVIATIONS PRINCIPLES 

A. Microcanonical ensemble 

The main quantity characterizing the thermodynamic 
properties of statistical mechanical systems in the micro- 
canonical ensemble is the entropy function. This quan- 
tity is defined in terms of the probability measure of all 
microstates of a system having the same value of the 
mean energy 0,0. To be precise, suppose that the 
system in question is composed of n particles, and de- 
note the microstates of that system by the joint state 
x" = {xi,X2; ■ ■ ■ tXji), where Xi represents the state of 
the z**^ particle taking values in some state space X. The 
set of all microstates is the n-fold product X"". Moreover, 
let us denote by U{x^) the energy or Hamiltonian of the 
system as a function of the microstates a;" and by u{x^) 
the mean energy or energy per particle, defined as 

u{x-^) = (1) 
n 

In terms of this notation the microcanonical entropy 
function s{u) is defined as 

s{u) = lim -\nP{u{x") G du}, (2) 

n — >OG Ti 

where 

P{u(a;") e du} = / P(da;") (3) 

is the probability measure of all microstates x" lying in 
the infinitesimal mean-energy ball du centered at u (23j |. 
The probability measure Pidx^) in ^ is the a priori 
measure on X^ , which is taken to be the uniform mea- 
sure in accordance with Boltzmann's equiprobability hy- 
pothesis In order for this probability measure to be 
well defined, the configuration space X"" is assumed to be 
bounded. As we will soon explain, conditioning P on the 
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set of configurations {a;" : u(a;") G du} defines the micro- 
canonical ensemble. We call this set the microcanonical 
conditioning set. 

We find it convenient to re-express the definition of 
s{u) in © via the formula 

P{u(a;") e dw} X e"'("). (4) 

This is done in order to emphasize the facts that 
P{u{x^) e du} has, to a first degree of approxima- 
tion, the form of an exponential that decays with n and 
that the exponential decay rate is the microcanonical 
entropy. The heuristic sign "x" is used here instead 
of the approximation sign in order to stress that 
the dominant term describing the asymptotic behavior 
of P{u{x"') € du}, as n — *■ oo, is the exponential func- 
tion e"'*("). 

In the theory of large deviations, the exponential 
asymptotic property of thermodynamic probability mea- 
sures is referred to as a large deviation principle (LDP). 
Detailed explanations of this theory are available in a 
number of references including [23, HE HE HE HI ■ For 
our purpose it is important to note that an LDP can 
be formulated not only for the energy per particle, but 
also for more general macroscopic variables such as the 
total spin per particle or the vector of occupation num- 
bers. As we point out in the next paragraph, the LDP 
is the basic tool for deriving the well known variational 
maximum-entropy principle characterizing the equilib- 
rium macrostates of the associated macroscopic variables 
in the microcanonical ensemble. 

The macroscopic variables that we consider are quan- 
tities L{x^) that take values in a space of macrostates £, 
and that are assumed to satisfy an LDP expressed by 

P{L(x") edL} = f P{dx'') (5) 

I 



In this formula L is an element of L, and P{i(a;") G dL} 
represents the probability measure of all microstates x" 
such that L{x'^) lies in the infinitesimal ball dL with cen- 
ter L. In addition, we assume that the energy per par- 
ticle ulx") can be written asymptotically as a function 
of L(a:") in the following sense. There exists a bounded, 
continuous function u(L), called the energy representa- 
tion function, with the property that M(a;") = u(L{x")) 
for all microstates a;" or, more generally, that 

|w(L(a;")) -u(a;")| ^0 (6) 

uniformly over all microstates x"" as n —>■ go. In Section 
IIVI we give a concrete example, for the BEG model, of 
a macroscopic variable L(x") for which assumptions Q 
and © are valid. In the case of the BEG model, the 
energy representation will in fact be seen to be exact 
in the sense that u(a:") — u{L{x")) for all x" and n. 
However, the properties discussed in this paper are valid 
also for models for which the limit jnj holds. 

Under assumptions © and © on the macroscopic 
variable L{x^), it is easily seen that the most probable 
macrostates L for configurations lying in the microcano- 
nical conditioning set {x" : u{x'^) € du} are those that 
maximize the entropy function s{L) subject to the energy 
constraint u{L) = u These constrained maximizers 
of s{L) compose the set f " of microcanonical equilibrium 
macrostates associated with a given mean-energy value 
u; in symbols, 

£^ — {L : L maximizes s{L) with u{L) — u}. (7) 



The physical importance of equilibrium macrostates in statistical mechanics stems from the fact that any macrostate 
not in has an exponentially small probability of being observed given that the energy per particle of the system 
is fixed at the value u. This can be seen by introducing the microcanonical probability measure P", which is 
defined by conditioning the a priori measure P on the microcanonical conditioning set H3; i^i symbols, P^{dx^) = 
P{dx" I u(x") G du}. Thus for L G C we have 

P"{i(a;") G dL} = P{L{x") G dL \ u{x") G du} (8) 
P{{a;" : L(a;") G dL} n {x" : m(x") G du}} 
^ P{ii(x") G du} 



As shown in Hl|, LDP's and (|3J) and the asymp- 
totic relationship © yield the LDP 



P"{L(x") G dL} 



where 



s(u) — s{L) if u{L) = u 



oo 



otherwise. 



(9) 



(10) 



By the general theory of large deviations, I^{L) is non- 
negative for any macrostate L € C. Hence if I^{L) > 0, 
then the microcanonical probability that L(x") is near 
L goes to 0, as n — > cxD, at the exponential decay rate 
I'^{L). This observation motivates our definition of the 
set f " of microcanonical equilibrium macrostates to be 
the set of macrostates L for which I'^{L) attains its min- 
imum of 0, and thus for which s{L) is maximized under 
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the constraint u{L) = u. As a consequence of this defi- 
nition, we have the following variational formula for the 
microcanonical entropy: 



In this formula, Zn{P) is the n-particle partition function 
defined by 



s(u) = sup s{L) = s(L"), 

{L:u{L)=u} 

where L" is any macrostate contained in f 



(11) 



(13) 



B. Canonical ensemble 

While the microcanonical ensemble is defined in terms 
of a fixed value of the mean energy it, the canonical en- 
semble is defined in terms of a fixed value of the inverse 
temperature /3. In the canonical ensemble the relevant 
probability measure on X" is the Gibbs measure 



in terms of which we define the canonical free energy 

ip{(3) = - lim -lnZ„(/?). (14) 



1 



(12) 



This last quantity plays an analogous role in the asymp- 
totic analysis of the canonical ensemble as the microca- 
nonical entropy plays in the asymptotic analysis of the 
microcanonical ensemble. 



J 



As we did in the case of the microcanonical ensemble, we now state the LDP for L(x") with respect to the canonical 
ensemble and then use this principle to define the set of canonical equilibrium macrostates. For any macrostate L 
and any microstates satisfying L{x"') € dL, the continuity of the energy representation function implies that u{x"') 
is close to u{L). Hence, we expect that 



P/3{i(a;") edL} ^ [ Ppidx'') 

J {x":L(x'^)edL} 



(15) 



is close to 



-0nu{L) 



{x"':L{x^)edL} 



P(da;") = 



ZniP) 



-l3nu{L) 



P{x" : i(a;") G dL}. 



(16) 



Substituting into this formula the large deviation esti- 
mate © for P{L{x'^) e dL} and the hmit (d)) relating 
Zn{(3) and (p{P), we obtain the LDP 

P/3{L(x") e dL} X e-"^^(^), (17) 

where 

IpiL)^Pu{L)-siL)~ip{P). (18) 

The function Ip{L) is nonnegative for any macro- 
state L. As in the case of the microcanonical ensem- 
ble, for any macrostate L satisfying Ip{L) > the LDP 
(|17|l shows that the corresponding canonical probability 
Pp{L{x'"^) G dL} converges to exponentially fast. As 
a result, such macrostates are not observed in the ther- 
modynamic limit. The conclusion that we draw from the 
LDP (|17|l is that, with respect to the Gibbsian probabil- 
ity measure Pp, the most probable macrostates are those 
for which Ip{L) attains its minimum of or, equivalently, 
those for which the quantity f3u{L) — s{L) is minimized 



for a fixed value of [3. Accordingly, the set of canoni- 
cal equilibrium macrostates associated with a fixed value 
of /3 is defined by 

8p = {L : (3u{L) — s{L) is minimized}. (19) 

We end this section by motivating the existence of the 
limit H14|l that defines the canonical free energy (p{P). 
In the definition H13|l of Z„(/3), we first use © to re- 
place the energy per particle u(a;") by u{L{x")). Since 
the macroscopic variable L(x") takes values in C, we can 
rewrite the resulting expression for Zn{P) not as an in- 
tegral over the set X" of microstates, but as an integral 
over C. Applying the LDP © for P{L{x"-) G dL), we 
obtain 

Z„(/3) = J e-^""(-^)F{i(a;") G dL} (20) 
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exp 



-n inf {BuiL) 



<L)} 



The last step is a consequence of Laplace's method, which 
states that, as n — > oo, the main exponential contribu- 
tion to the integral comes from the largest value of the 
integrand or equivalently the smallest exponent 0, . 
These calculations motivate the variational formula 



m = ini {f3u{L) - s{L)} 



(21) 



where is any member of Efj. We call this the macro- 
state representation of (p((3)- Formula (|21() can be derived 
rigorously from the LDP (jSJ using Varadhan's theorem 

We denote by hi the set of mean energy values u for 
which s{u) > —oo. Any u d 14 is called an admissible 
value of the mean energy. Using a similar chain of argu- 
ments as in the preceding paragraph, one can also write 

Z„(/3) = [ e-'3""P{u(x") e du} (22) 
Ju 



exp — n inf — s{u)} 



This asymptotic formula motivates the fundamental re- 
lationship 



^(/3) = mU(3u - siu)}, 



(23) 



which expresses the thermodynamic free energy ip{P) as 
the Legendre-Fenchel transform of the microcanonical en- 
tropy s(u) j23l] . We call this formula the thermodynamic 
representation of ip{l3). 

The same formula for ip{(3) can also be derived by 
rewriting the infimum over L in (|21|l as an infimum over 
all mean-energy values u followed by a constrained in- 
fimum over all L satisfying u{L) — u. Using we 
obtain 

^(/3) = inf inf {Pu{L)-s{L)} (24) 

tiGM {L:u(L)=u} 



= inf <^ Bu - inf {s{L)] 

ueU [ {L:u{L)=u} 

= mi{Pu-s{u)}. 



In the case where s{u) is a strictly concave differentiable 
function of u and /3 is in the range of s' , this Legendre- 
Fenchel transform reduces to the usual differential form 
of the Legendre transform. This is given by 



^(/3) = (3u{P) - s(u(/3)). 



(25) 



where u{(3) is the unique solution of the equation /3 = 
s'{u). 



III. EQUIVALENCE AND NONEQUIVALENCE 
OF ENSEMBLES 

A. Thermodynamic level 

The problem of the equivalence or nonequivalence of 
the microcanonical and canonical ensembles at the ther- 
modynamic level is fundamentally related to the prop- 
erties of the Legendre-Fenchel transform, and especially 
its invertibility or noninvertibility properties as a func- 
tional transform. Equation 123|l expresses ifiiP) as the 
Legendre-Fenchel transform of s{u). A basic question is 
whether one can invert this Legendre-Fenchel transform 
by applying the same transform to (p{/3) so as to obtain 



s{u) 



inf{/3u 



(26) 



We claim that such an inversion of the Legendre- 
Fenchel transform is valid if and only if s{u) is concave 
on its domain of definition. The sufficiency can be seen 
by introducing the function 



s**{u)=M{Pu-ipiP)}, 



(27) 



which is concave on its domain of definition and equals 
the minimal concave function majorizing s{u) for all u 
[23, El- We call s**{u) the concave hull of s{u) and de- 
pict it in Figure n It follows that if s{u) is concave on 
its domain of definition, then s and s** coincide. Replac- 
ing s**{u) by s{u) in H27|) expresses the microcanonical 
entropy s(u) as the Legendre-Fenchel transform of the 
free energy ^p{l3). Conversely, since any function written 
as the Legendre-Fenchel transform of another function 
is automatically concave on its domain of definition, it 
follows that if s{u) is not concave on its domain of defini- 
tion, then it cannot be written as the Legendre-Fenchel 
transform of the free energy. 

This discussion motivates the following definitions. We 
define the two ensembles to be thermodynamically equiv- 
alent at u if s{u) — s**{u); in this case we say that 
s is concave at u. In the opposite case — namely, if 
s{u) ^ s**(u) — we call the two ensembles thermodynam- 
ically nonequivalent at m.|34| If s{u) ^ s**{u), then s is 
said to be nonconcave at u. For later reference we say 
that s is strictly concave at u if s{u) — s**{u) and s** is 
strictly concave at u in the sense that the graph of s** is 
not flat around u. 

The noninvertibility of the Legendre-Fenchel transform 
for nonconcave functions has no effect on how ip is to be 
calculated from s. The canonical free energy (p{P) is al- 
ways a concave function of the inverse temperature /3; 
regardless of the form of s{u), ^p{(3) can always be ex- 
pressed via the fundamental Legendre-Fenchel relation- 
ship H23|l . As we have just seen, however, s{u) can be 
expressed as the Legendre-Fenchel of (p(/3) if and only if 
s{u) is concave on its domain of definition. In this sense, 
the microcanonical ensemble is more fundamental than 
the canonical ensemble. 
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FIG. 1: (Left) Plot of a nonconcave entropy function s(u) together with its concave envelope s**(u). The nonconcavity region 
equals the open interval {ui,Uh)- (Right) The corresponding free energy function ^(/3) obtained by calculating the Legendre- 
Fenchel transform of s(it). The region of nonconcavity of s{u) is signalled by the appearance of a point /3c where fiP) is 
nondifferentiable. The quantity /3c equals the slope of the affine part of s**{u), and the left-hand and right-hand derivatives of 
</9 at /3c equal Uh and ui, respectively. 



This apparent superiority of the microcanonical en- 
semble over the canonical ensemble does not prevent us 
from deriving a criterion based entirely on the canonical 
ensemble for verifying that the two ensembles are ther- 
modynamically equivalent. Indeed, suppose that '^{(3) 
is differentiable for all j3. Then the Gartner-Ellis The- 
orem guarantees that, with respect to the a priori mea- 
sure P, the energy per particle u(x") satisfies the LDP 
with entropy function siu) given by the Legendre-Fenchel 
transform of ip{P) 0,123 • Because (p(/3) is assumed to 
be everywhere differentiable, the general theory of these 
transforms guarantees that s{u) is strictly concave on 
its domain of definition. We conclude that if ip{0) is 
everywhere differentiable, then thermodynamic equiva- 
lence of ensembles holds for all admissible values of the 
mean energy. "SS*! This can be expressed in more physical 
terms by saying that the absence of a first-order phase 
transition in the canonical ensemble implies that the en- 
sembles are equivalent at the thermodynamic level. Un- 
fortunately, the converse statement does not hold as the 
nondifferentiability of (p at some (i corresponds to one 
of the following: either s{u) is not concave over some 
range of mean-energy values or s{u) is concave, but not 
strictly concave, over some range of mean-energy values 
(see Figure P). 

In the next subsection we examine the equivalence and 
nonequi valence of ensembles at a higher level; namely, 
that of equilibrium macrostates. Among other results 
we will see that the strict concavity of s at some mean- 
energy value u implies a strong form of equivalence that 
we call full equivalence. 



B. Macrostate level 

At the level of equilibrium macrostates, the natural 
questions to consider for characterizing the equivalence 
or nonequivalence of the microcanonical and canonical 
ensembles are the following. For every /3 and every 
in the set of canonical equilibrium macrostates, does 



there exist a value of u such that Lp lies in the set £^ of 
microcanonical equilibrium macrostates? Conversely, for 
every u and every L" G does there exist a value of 
fi such that ^ £pl In trying to relate the macrostate 
level of equivalence and nonequivalence of ensembles with 
the thermodynamic level of equivalence and nonequiva- 
lence, we also ask the following. Are there thermody- 
namic conditions expressed in terms of properties of s{u) 
or (/?(/?) ensuring a correspondence or a lack of corre- 
spondence between the members of f " and those of £pl 
In particular, does equivalence of ensembles at the ther- 
modynamic level implies equivalence of ensembles at the 
level of equilibrium macrostates? 

In Ellis, Haven, and Turkington have provided 
precise answers to these questions expressed in terms 
of relationships between the solutions of the constrained 
minimization problem Q that characterizes the set f " 
of microcanonical equilibrium macrostates and the so- 
lutions of the dual unconstrained minimization problem 
(|19|l that characterizes the set £p of canonical equilib- 
rium macrostates. Their main results are summarized 
in items 1-4 below; they apply in general to any statis- 
tical mechanical model that have macroscopic variables 
L(a;") satisfying an LDP as in (O with an entropy func- 
tion s{L) and that have an energy representation function 
u{L) satisfying ©. In order to simplify the presentation, 
we consider only mean-energy values u lying in the inte- 
rior of the domain of definition of s, and we assume that 
s is differentiable at all such u. The reader is referred to 
23] for complete proofs of more general results that hold 
under weaker assumptions. 

1. Canonical equilibrium macrostates can always be 
realized microcanonically. Let /3 be given. Then 

U f", (28) 

where u(£fj) denotes the set of mean-energy values 
u that can be written as u{L) for some L ^ £j3. 
Formula H28|l implies that for any macrostate -L/3 £ 
£l3 there exists u e u{£p) such that Ljj £ 
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2. Full equivalence. If s is strictly concave at u, then 

= for /3 = s'(m). 

3. Partial equivalence. If s is concave at u but not 
strictly concave, then C £p for /3 = i.e., 
f " is a proper subset of S/s which does not coincide 
with £/3. According to items 2 and 3, thermody- 
namic equivalence of ensembles for some value of u 
implies either full or partial equivalence of ensem- 
bles at the level of equilibrium macrostates for that 



4. Nonequivalence. If s is nonconcave at u, then 
£'"n£^/3 = for all (3. In other words, if there is ther- 
modynamic nonequivalence of ensembles for some 
value of u, then the microcanonical equilibrium 
macrostates corresponding to that u are nowhere 
realized within the canonical ensemble. 

We spend the remainder of this section sketching part 
of the proof of full equivalence as stated in item 2 and 
proving nonequivalence as stated in item 4. The crucial 
insight needed to prove that s[u) = s**{u) impHes = 
Ep with (3 = s'{u) is provided by a basic result of convex 
analysis which states that 



s{v) < s{u) + l3{v — u) 



(29) 



for all V if and only if s{u) — s**{u) and f3 — s'(u). 
Accordingly, if we suppose that s{u) — s**(u), then 



Pu — s{u) < Pv — s(v) 



(30) 



for all V. Using the thermodynamic representation of 
fiP) in and the macrostate representation of ip{(3) 
in 1211) . we see that 



l3u — s{u) < inf {I3v — s{v)} 



(31) 



= mf {/3«(L) - s(L)}. 

Now choose any G by the definition of this set 
u{L'^) = u, and s(i") = s{u). This allows us to write 

pu(V') - siL"") < M^{(3u{L) - s{L)}. (32) 

We deduce that minimizes (3u{L) — s{L) or equiva- 
lently that L" G Ep. Since L" is an arbitrary element 
of it follows that £"■ C £p. One completes the proof 
that £"■ — £p by showing that if s is strictly concave at 
M, then is not a proper subset of Ep. 

To prove the assertion on nonequivalence of ensembles 
given in item 4, we proceed in a similar fashion. The 
assumption that s is nonconcave at u implies that 



s(u) < s**(u) — inf {7U — (/3(7)} 
7 

< /3u-(^(/3) 



(33) 



for all (3. This can be rewritten as 

Pu - s{u) > ip{P). 



(34) 



Now choose any e £^ and any /3. Since u{L^) = u 
and s{L'^) — s{u), it follows that 

/?u(i") - s{L") > if {(3) ^ M^{Pu{L) - s{L)}. (35) 

This shows that is not a minimizer of Pu(L) — s{L) 
and thus that L" ^ £p. Since L" is an arbitrary element 
of £^ and P is arbitrary, we conclude that £"0 5/3 = 
for all P, as claimed. 



IV. ILLUSTRATION OF THE RESULTS FOR 
THE BLUME-EMERY-GRIFFITHS MODEL 

We now come to the main point of our study which is 
to illustrate, in the context of a simple spin model, the 
general results presented in the previous section about 
macrostate equivalence and nonequivalence of ensembles 
and the relationship with thermodynamic equivalence 
and nonequivalence. The model that we consider for 
this purpose is a slight variant of the mean-field Blume- 
Emery-Grifhths (BEG) model defined by the following 
Hamiltonian |21i,|23||: 




(36) 



In this formula Xi represents a spin variable at site i tak- 
ing values in the set X — { — 1,0,-1-1}, and if is a positive 
real constant. The a priori measure for the model is de- 
fined by 

P(x") = \ for every a;" G A:"". (37) 

The macroscopic variable that we use to investigate 
the equivalence and nonequivalence of the microcanonical 
and canonical ensembles is the empirical vector L{x^) 
defined as 

L(x") = (L_i(x"),Lo(a;"),i+i(x")), (38) 
where for j = —1, 0, +1 



n 



(39) 



In physics, this quantity is often referred to as the one- 
particle state distribution or statistical distribution; its 
three components L_i(a;"), ^0(2;"), and L+i(a;") give 
the proportion of spins in the microstate a;" that take 
the respective values —1, 0, and -1-1. Accordingly, L{x") 
takes values in the space £ of probability vectors L = 
(L_i, io, i+i) defined by the conditions Lj > for j = 
-1, 0, +1, and L_i -I- Lo + L+i = 1. 
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For L G C we define the function 

u{L) = L+, + L_,-K{L+,-L_,f. (40) 

The choice of the empirical vector for studying equiva- 
lence and nonequivalence of ensembles is dictated by the 
fact that we can represent the energy per particle 

= ^ = - E - f - E V (41) 

i=i \ i=i / 

in the form 

u{xn = E^.(^")j"-^(E^^(^")-?| (42) 
jex \jex J 

= L+i(x") + L_i(x") - X(L+i(x") - L-i(x"))2 

This display, which holds exactly for all and n, veri- 
fies assumption 10 for the BEG model with the energy 
representation function u{L) defined in (|40|) . Moreover, 
one shows either by a combinatorial argument based on 
Stirling's approximation O'" from Sanov's Theo- 

rem [23, 123 that with respect to the a priori probability 
measure P, L{x'^) satisfies an LDP of the form Q with 
the entropy function 

s{L)^-Y,L,\nL., -In?,. (43) 

These properties of X(a;") allow us to characterize the 
equilibrium macrostates with respect to each ensemble 
as solutions of an appropriate optimization problem. In 
order to simplify the notation, the components of prob- 
ability vectors L G C will be written as (L_,Lo7^+) in- 
stead of as (L_i, io, L+i). We first consider the set of 
microcanonical equilibrium empirical vectors L" associ- 
ated with the mean energy u. According to the definition 
(|7|l. the equilibrimn macrostates L" are characterized as 
maximizers of s(L) over the space C subject to the con- 
straint u{L'^) = u. Solving this problem necessitates only 
the maximization of a function of one variable since the 
normalization constraint on the components of the em- 
pirical vector reduces the number of independent com- 
ponents of to two, while the microcanonical energy 
constraint reduces this number by one more. The set 
EjS of canonical equilibrium empirical vectors parameter- 
ized by the inverse temperature (3 is defined in H19(l . The 
elements Lp of this set are characterized as maximizers 
of the quantity (3u{L) — s{L) over C. In this case, we 
are faced with an unconstrained two-dimensional maxi- 
mization problem involving the two components Lj^ and 

In Figure[5]we present a first set of solutions for £^ and 
£p corresponding to the value K — 1.1111, together with 
a plot of the derivative of the microcanonical entropy 
function s{u). Because neither of the two optimization 



problems involved in the definitions of 5" and could be 
solved analytically, we provide numerical results obtained 
using various routines available in the scientific software 
Mathematica. The top left plot of Figure|21showing s'{u) 
was obtained by calculating an empirical vector G 
which satisfies u{L'^) — u and s(L") ~ s{u). The top 
right and the bottom left plots display, respectively, the 
canonical and microcanonical equilibrium components of 
the empirical vector as a function of the parameters (3 
and u defining each of the two ensembles. In the top 
right plot, the solid curve can be taken to represent the 
spin -|-1 component of the equilibrium empirical vector 
L/3, while the dashed curve can be taken to represent 
the spin —1 component of the same equilibrium empiri- 
cal vector. Since the BEG Hamiltonian satisfies the ex- 
change symmetry L+ L_, the roles of the solid and 
dashed curves can also be reversed. For (3 < Pc, the 
solid curve represents the common value of L_|_ = L_. 
In all cases, the component Lq of Lp is determined by 
Lq = 1 — i+ — i-. The same explanation applies to the 
bottom left plot of L". 

The first series of plots displayed in Figure [3 were 
designed to illustrate a case where s{u) is concave and 
where, accordingly, we expect equivalence of ensembles. 
That the equivalence of ensembles holds in this case at 
the level of the empirical vector can be seen by noting 
that the solid and dashed curves representing the and 
L_ components of Lp in the top right plot can be put 
in one-to-one correspondence with the solid and dashed 
curves representing the same two components of L" in 
the bottom left plot. The one-to-one correspondence is 
defined by the derivative of the microcanonical entropy 
s{u): for a given u we have L" = Lp(^u) with /3(m) = s'{u). 
Moreover, since the monotonic function s'(u) can be in- 
verted to yield a function u{f3) satisfying s'(u(/?)) = /3, 
we have Lp — L^'^^'^ for all (3. Thus, the equilibrium 
statistics of the BEG model in the microcanonical en- 
semble can be translated unambiguously into equivalent 
equilibrium statistics in the canonical ensemble and vice 
versa. In this case, the critical mean energy at which 
the BEG model goes from a high-energy phase of zero 
magnetization m{L) = — L_ to a low-energy phase 
of nonzero magnetization in the microcanonical ensem- 
ble can be calculated from the viewpoint of the canonical 
ensemble by finding the critical inverse temperature Pc 
that determines the onset of the same phase transition 
in the canonical ensemble. Since the two ensembles are 
equivalent, both the microcanonical and canonical phase 
transitions must be of the same order, which in this case 
is second-order. 

In the second series of plots in Figure |31 of en- 

semble nonequivalence corresponding to the value K = 
1.0817 is shown. Since in the top left plot s'{u) is not 
monotonic, s{u) is not concave. As in Figure^ the open 
interval {ui,Uh) of mean-energy values is the interval on 
which s{u) ^ s**(u); on this interval s{u) is nonconcave 
and s**{u) is affine with slope /3c. By comparing the 
top right plot oi Lp and the bottom left plot of L", we 



9 



0.378 0.3782 0.3784 0.3786 0.3788 0.379 0.17 0.18 0,19 0.2 0.21 



s'{u) 





89 



/3 



V' 0.19 




0.18 



0.378 0.3782 0.3784 0.3786 0.3788 0.379 



FIG. 2: Full equivalence of ensembles for the BEG model with K = 1.1111. (Top left) Derivative of the microcanonical entropy 
s(it). (Top right) The components L+ and L_ of the equilibrium empirical measure Lp in the canonical ensemble as functions 
of p. For (3 > Pc the solid and dashed curves can be taken to represent L+ and L_, respectively, and vice versa. (Bottom left) 
The components L+ and L- of the equilibrium empirical measure L" in the microcanonical ensemble as functions of it. For 
u < Uc the solid and dashed curves can be taken to represent L+ and L-, respectively, and vice versa. 



see that the elements of f " cease to be related to ele- 
ments of £f3 for all mean-energy values u in the interval 
(u;,u/i). In fact, for any u in this interval of thermody- 
namic nonequivalence of ensembles (shaded region) no Lp 
exists that can be put in correspondence with an equiv- 
alent equilibrium empirical vector contained in . This 
lack of correspondence agrees with the rigorous results 
of Ellis, Haven, and Turkington 23] reviewed in Section 
mil Thus, although the equilibrium macrostates L" cor- 
responding to M £ (ui,Uh) are characterized by a well de- 
fined value of the mean energy, it is impossible to assign 
a temperature to those macrostates from the viewpoint 
of the canonical ensemble. In other words, the canonical 
ensemble is blind to all mean-energy values u contained 
in the domain of nonconcavity of s{u). By decreasing 
(3 continuously through the critical value /?c, the equi- 
librium value of the energy per particle associated with 
the empirical vectors in Ep jumps discontinuously from ui 
to Uh (canonical first-order phase transition). However, 
outside the range (u/, u^) we have equivalence of ensem- 



bles, and a continuous variation of /? induces a continuous 
variation of u. 

We can go further in our analysis of the plots of Fig- 
ure 01 by noting that the phase transition exhibited in 
the microcanonical ensemble is second-order (continu- 
ous) whereas it is first-order (discontinuous) in the ca- 
nonical ensemble. This provides another clear evidence 
of the nonequivalence of the two ensembles. Again, be- 
cause the canonical ensemble is blind to all mean-energy 
values located in the nonequivalence region, only a mi- 
crocanonical analysis of the model can yield the critical 
mean energy Uc- As for the critical inverse temperature 
/3c, which signals the onset of the first-order transition in 
the canonical ensemble, its precise value can be found by 
calculating the slope of the afhne part of s**{u) or, equiv- 
alently, by identifying the point of nondifferentiability of 
(/?(/?) (see the caption of Figure QJ. 

A further characterization of /3c can also be given in 
terms of the three solutions of the equation s'{u) = /3c. 
In Figure |31 ui is the smallest of these solutions and Uh 
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u 



FIG. 3: Equivalence and nonequivalence regions for the BEG model with K = 1.0817. The solid and dashed curves are 
interpreted as in Figure El The shaded area in the bottom left plot corresponds to the region of nonequivalence of ensembles 
delimited by the open interval {ui,Uh)- The ranges of the inverse temperature and the mean energy used to draw the plots 
were chosen so as to obtain a good view of the phase transitions. 



the largest. We denote by Um the intermediate solution 
of s'{u) = (3c- Because 

s{ui) = s**{ui), s{uh) = s**{uh), (44) 

and 

s**{uh)-s**{ui) = Pc{uh-~ui), (45) 
it follows that 

/ [Pc - s'{u)] du = f3c{uh - ui) - [s{uh) - s{ui)] = 0. 
J III 

(46) 

Revifriting this integral in terms of Um, we see that 

/ [(3c-s'{u)]du^ / [s'{u)~(3c]du. (47) 

This equation expresses the equal-area property of /3c, 
first observed by Maxwell (see [al). 

To conclude this section, we present in Figure 01 a fi- 
nal series of plots of s'(u), Lp, and L" corresponding to 



K — 1.0805, a slightly smaller value than the one con- 
sidered in Figure 13 As in Figure |3| there also exists in 
Figure 21 an open interval (u/, Uh) over which s(u) is non- 
concave. For u G {ui,Uh) we consequently have nonequiv- 
alence of ensembles, illustrated by the shaded region in 
the bottom left plot. As in Figure (31 the nonequivalence 
of ensembles is associated with a first-order phase tran- 
sition in the canonical ensemble determined by Pc- The 
microcanonical phase transition in Figure 01 is also first- 
order due to the jump in s'{u) as u increases through 
the critical value Uc- By contrast, the microcanonical 
transition is second-order in Figure O 



V. CONCLUSION 

We have provided in this paper a simple illustration 
of the fact that the nonequivalence of the microcanonical 
and canonical ensembles at the thermodynamic level en- 
tails a much more fundamental nonequivalence of these 
ensembles at the level of equilibrium macrostates. Fo- 
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FIG. 4: Equivalence and nonequivalence regions for the BEG model with K = 1.0805. The solid and dashed curves are 
interpreted as in Figured The shaded area in the bottom left plot corresponds to the region of nonequivalence of ensembles 
delimited by the open interval {ui, Uh)- 



cusing our attention on the mean-field Blume-Emery- 
GrifRths (BEG) model, we showed that if the microca- 
nonical entropy s is strictly concave at a mean-energy 
value u, then the microcanonical equilibrium distribu- 
tions of states characterizing the equilibrium configura- 
tions of the BEG model at the fixed value u are realized 
canonically for inverse temperature (3 given by /3 = s'(u). 
We also showed that if the microcanonical entropy s is 
nonconcave at u, then the equilibrium distributions of 
states calculated microcanonically at the fixed value u 
are nowhere to be found in the canonical ensemble. 

This latter case of macrostate nonequivalence is illus- 
trated in Figures inland^ for various values of the param- 
eter K entering in the expression of the Hamiltonian of 
the BEG model. In each case there exists a continuum 
of nonequivalent equilibrium distributions of states that 
can be associated with mean-energy values u satisfying 
s{u) < s**(u), but cannot be associated with values of 
the inverse temperature (3, the controllable parameter of 
the canonical ensemble. These results make it clear that 
the microcanonical ensemble is richer than the canonical 
ensemble, since the latter ensemble skips over the entire 



range of mean-energy values for which the entropy is non- 
concave. As we have seen, this implies the presence of a 
first-order phase transition in the canonical ensemble. 

It should be remarked that although nonequivalent mi- 
crocanonical equilibrium distributions cannot be associ- 
ated with any value of the parameter /3, one is not pre- 
vented from assigning to these distributions a microca- 
nonical inverse temperature equal to the derivative of 
the microcanonical entropy. While this is a well-defined 
theoretical possibility, important questions are whether 
this procedure has any physical significance and whether 
such a microcanonical analog of inverse temperature can 
be measured. 

We end this paper by discussing another point related 
to nonequivalent equilibrium macrostates corresponding 
to mean-energy values at which the microcanonical en- 
tropy is nonconcave. In the course of doing our numerical 
calculations leading to the determination of the sets 5" 
and we noticed that the equilibrium distributions of 
states L" that exist microcanonically but not canonically 
are metastable macrostates of the canonical ensemble; 
i.e., they are local but not global minima of the quan- 
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tity (3u{L) — s{L). This was observed to occur when 
the mean energy u associated with these states satisfies 
s"{u) < 0. For u satisfying s"{u) > 0, we found instead 
that the macrostates L" in the region of nonequivalence 
are saddle points of (3u{L) — s{L). 

At this stage we cannot prove that this phenomenon 
holds in generality. However, our results lead us to con- 
jecture that it is valid for a wide range of statistical me- 
chanical models that have macroscopic variables satis- 
fying an LDP as in jSJ with an entropy function s(L), 
and that have an energy representation function u{L) 
satisfying (|SJ). A number of theoretical and computa- 
tional results found by Eyink and Spohn [30| and An- 
toni et al. 31], respectively, seem also to support such a 
conjecture. Besides, we know from the theory of La- 
grange multipliers that all the microcanonical equilib- 
rium macrostates are extremal points of the quantity 
(3u{L) — s{L), the global minimizers of which define the 
canonical equilibrium macrostates. In this context, what 
remains to be found is then just a way to determine the 



precise nature of these extremal points based on the prop- 
erties of s(u ) . Work aimed at elucidating this problem is 
ongoing p^ . 
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